WIND - A Warehouse for Internet Data

نویسندگان

  • Lukas C. Faulstich
  • Myra Spiliopoulou
  • Volker Linnemann
چکیده

The increasing amount of information available in the web demands sophisticated querying methods and knowledge discovery techniques. In this study, we introduce our architectural framework WIND for a data warehouse over a domain-speciic thematic section of the In-ternet. The aim of WIND is to provide a partially materialized structured view of the underlying information sources, on which database querying can be applied and mining techniques can be developed. WIND loads web documents into several complementary local repositories like OODBMSs and text retrieval systems. This allows for a combination of attribute and content-oriented query processing. Special interest is paid to domain-speciic document formats. To support conversion between (semi-)structured documents and database objects, we consider a technique for the generation of format converters based on the notion of object-grammars.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

An Object Data Warehousing Approach: a Web Site Repository

Internet provides a huge amount of data, but this information cannot be exploited to make decisions. An integrated view over multiple, autonomous and heterogeneous Web sources can be defined as a data warehouse which collects relevant data and keeps history (evolutions) of data to improve decision making. A data warehouse exploits large volumes of data to turn passive information into useful ac...

متن کامل

Improvement of the Analytical Queries Response Time in Real-Time Data Warehouse using Materialized Views Concatenation

A real-time data warehouse is a collection of recent and hierarchical data that is used for managers’ decision-making by creating online analytical queries. The volume of data collected from data sources and entered into the real-time data warehouse is constantly increasing. Moreover, as the volume of input data to the real time data warehouse increases, the interference between online loading ...

متن کامل

ارائه مدل تلفیقی برای ارزیابی آمادگی سازمان ها جهت پیاده سازی سیستم انباره داده با استفاده ازتحلیل سلسله مراتبی

Enterprise Data Warehouse initiative is a high investment project. The adoption of Data Warehouse will be significantly different depending upon the level of readiness of an organization. Before implementation of Data Warehouse system in a firm, it is necessary to evaluate the level of the readiness of firm. A successful Data Warehouse assessment model requires a deep understanding of opportuni...

متن کامل

Building XML Data Warehouse

With the proliferation of XML-based data sources available across the Internet, it is increasingly important to provide users with a data warehouse of XML data sources to facilitate decision-making processes. Due to the extremely large amount of XML data available on web, unguided warehousing of XML data turns out to be highly costly and usually cannot well accommodate the users’ needs in XML d...

متن کامل

Change Detection and Maintenance of an XML Web Warehouse

The World Wide Web contains a huge and increasing volume of information. The web warehouse is an efficient and effective means to facilitate utilization of information on the Web, not only to individual users but also to business organizations, especially for decision-making purposes. On the other hand, XML has recently become the new standard for representation and exchange of data on the Web....

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1997